Evaluating the Proportional Hazards Assumption - Methods
We have four ways to assess PH assumption
- log-log survival curves
- observed versus expected survival curves
- Goodness of fit test
- Time-Dependent Covariates
Graphical Approach 1: Log-Log Plots
The two graphical approaches for checking the PH assumption are comparing log-log survival curves and comparing observed versus expected survival curves. We first explain what a survival curve is and how it is used.
A log-log survival curve is simply a transformation of an estimated survival curve that results from taking the natural log of an estimated survival probability twice. Mathematically, we write a log-log curve as . Note that the log of a probability such as is always a negative number. Because we can only take logs of positive numbers, we need to negate the first log before taking the second log. The value for may be positive or negative, either of which is acceptable, because we are not taking a third log.
- is negative is positive
- can't take log of , but can take log of
- may be positive or negative
As an example, in the graph below, the estimated survival probability of is transformed to a log-log value of . Similarly, the point on the survival curve is transformed to a value of .
Note that because the survival curve is usually plotted as a step function, so will the log-log curve be plotted as a step function.
To illustrate the computation of a log-log value, suppose we start with an estimated survival probability of . Then the log-log transformation of this value is , which is , because equals . Now, equals , because equals . Thus, the transformation equals .
As another example, if the estimated survival probability is , then equals , which equals .
Why the PH assumption can be assessed by evaluating whether or not log-log curves are parallel.
Recall the formula for the survival curve that corresponds to the hazard function for the Cox PH model. Recall that there is a mathematical relationship between any hazard function and its corresponding survival function. We therefore can obtain the formula shown here for the survival curve for the Cox PH model. In this formula, the expression denotes the baseline survival function that corresponds to the baseline hazard function .
The log-log formula requires us to take logs of this survival function twice. The first time we take logs we get the expression shown here.
Now since denotes a survival probability, its value for any and any specification of the vector will be some number between 0 and 1. It follows that the natural log of any number between 0 and 1 is a negative number, so that the log of as well as the log of are both negative numbers. This is why we have to put a minus sign in front of this expression before we can take logs a second time, because there is no such thing as the log of a negative number.
Thus, when taking the second log, we must obtain the log of , as shown here. After using some algebra, this expression can be rewritten as the sum of two terms, one of which is the linear sum of the and the other is the log of the negative log of the baseline survival function.
(probability) negative value, so and are negative. But is positive, which allows us to take logs again.
or
Now suppose we consider two different specifications of the vector, corresponding to two different individuals, and .
Two individuals:
Then the corresponding log-log curves for these individuals are given as shown here, where we have simply substituted and for in the previous expression for the log-log curve for any individual .
does not involve
Subtracting the second log-log curve from the first yields the expression shown here. This expression is a linear sum of the differences in corresponding predictor values for the two individuals. Note that the baseline survival function has dropped out, so that the difference in log-log curves involves an expression that does not involve time .
Alternatively, using algebra, we can write the above equation by expressing the log-log survival curve for individual as the log-log curve for individual plus a linear sum term that is independent of .
Therefore, if we use a Cox PH model and we plot the estimated log-log survival curves for individuals on the same graph, the two plots would be approximately parallel. The distance between the two curves is the linear expression involving the differences in predictor values, which does not involve time. Note, in general, if the vertical distance between two curves is constant, then the curves are parallel.
The parallelism of log-log survival plots for the Cox PH model provides us with a graphical approach for assessing the PH assumption. That is, if a PH model is appropriate for a given set of predictors, one should expect that empirical plots of log-log survival curves for different individuals will be approximately parallel.
Note: By empirical plots, we mean plotting log-log survival curves based on Kaplan-Meier (KM) estimates that do not assume an underlying Cox model.
Example
Consider the comparison of treatment and placebo groups in a clinical trial of leukemia patients, where survival time is time, in weeks, until a patient goes out of remission. Two predictors of interest in this study are treatment group status (1 = placebo, 0 = treatment), denoted as , and log white blood cell count (log WBC), where the latter variable is being considered as a confounder.
To assess whether the PH assumption is satisfied for either or both of these variables, we would need to compare log-log survival curves involving categories of these variables.
One strategy to take here is to consider the variables one at a time. For the variable, this amounts to plotting log-log KM curves for treatment and placebo groups and assessing parallelism. If the two curves are approximately parallel, as shown here, we would conclude that the PH assumption is satisfied for the variable . If the two curves intersect or are not parallel in some other way, we would conclude that the PH assumption is not satisfied for this variable.
For the log WBC variable, we need to categorize this variable into categories - say, low, medium, and high - and then compare plots of log-log KM curves for each of the three categories. In this illustration, the three log-log Kaplan-Meier curves are clearly nonparallel, indicating that the PH assumption is not met for log WBC.
Graphical Approach 2: Observed Versus Expected Plots
The use of observed versus expected plots to assess the PH assumption is the graphical analog of the goodness-of-fit (GOF) testing approach to be described later, and is therefore a reasonable alternative to the log-log survival curve approach.
As with the log-log approach, the observed versus expected approach may be carried out using either or both of two strategies-(1) assessing the PH assumption for variables one-at-a-time, or (2) assessing the PH assumption after adjusting for other variables. The strategy which adjusts for other variables uses a stratified Cox PH model to form observed plots, where the PH model contains the variables to be adjusted and the stratified variable is the predictor being assessed.
Here, we describe only the one-at-a-time strategy, which involves using KM curves to obtain observed plots.
Example
As an example, for the remission data on 42 leukemia patients we have illustrated earlier, the KM plots for the treatment and placebo groups, with 21 subjects in each group, are shown here. These are the "observed" plots.
To obtain "expected" plots, we fit a Cox PH model containing the predictor being assessed. We obtain expected plots by separately substituting the value for each category of the predictor into the formula for the estimated survival curve, thereby obtaining a separate estimated survival curve for each category.
To compare observed with expected plots we then put both sets of plots on the same graph as shown here.
If observed and expected plots are:
- close, complies with PH assumption
- discrepant, PH assumption violated
The Goodness of Fit (GOF)Testing Approach
The implementation of the test can be thought of as a three-step process.
Run a Cox PH model and obtain Schoenfeld residuals for each predictor.
Create a variable that ranks the order of failures. The subject who has the first (earliest) event gets a value of 1, the next gets a value of 2, and so on.
Test the correlation between the variables created in the first and second steps. The null hypothesis is that the correlation between the Schoenfeld residuals and ranked failure time is zero.
If p-value small, then departure from PH.
Assessing the PH Assumption Using Time-Dependent Covariates
A. Use extended Cox model: contains product terms of form , where is function of time, e.g., , or , or heaviside function.
B. One-at-a-time model:
Test using Wald or LR test (chi- square with .
C. Evaluating several predictors simultaneously:
where is function of time for th predictor.
Test using (chi-square) test with df.
Drawback:
- choice of not always clear
- different choices may lead to different conclusions about assumption.